Thresholding, Noise Reduction and Skew correction of Sinhala Handwritten Words
نویسندگان
چکیده
The Sinhala script, which is generally with round characters, is unique among other Brahmi-descended scripts and is used by 70% of the 18 million populations in Sri Lanka. There has been no published research on the cursive unconstrained Sinhala handwriting recognition. This paper proposes vital preprocessing stages, which are categorized under thresholding, noise removal, skew detection and correction algorithm and is useful to improve accuracy of segmentation and recognition of the Sinhala handwritten words. This paper introduces a novel skew detection method based on least square method and also robust indirect skew correction method of unconstrained cursive Sinhala words. Threshold selection is used in combination with three methods such as QIR, NIR, and analyzing gray level intensity distribution. Median filtering algorithm and connected component analysis method are used to reduce the noise in Sinhala handwritten word images and used vertical projection profile histogram based on validation technique to improve the noise reduction of the image. In this proposed system over 700 handwritten patterns in Sinhala handwritten real postal addresses in NSF database [3] were tested and the reported accuracy was 97.2% and 500 handwritten patterns, which were written by undergraduate students, were tested and the reported accuracy was 99.6%. Overall accuracy of 98.4% was reported using these two types of handwritten patterns.
منابع مشابه
New Preprocessing Techniques for Handwritten Word Recognition
The research described in this paper focuses on the presentation of two novel preprocessing techniques for the task of off-line handwritten word recognition. A technique for the identification of straight and skewed underline noise is described along with a novel algorithm for detecting skew in handwritten words. The latter identifies skew by detecting the center of mass in each half of a word ...
متن کاملNew Preprocessing Techniques for Handwritten Word Recognition
The research described in this paper focuses on the presentation of two novel preprocessing techniques for the task of off-line handwritten word recognition. A technique for the identification of straight and skewed underline noise is described along with a novel algorithm for detecting skew in handwritten words. The latter identifies skew by detecting the center of mass in each half of a word ...
متن کاملSkew Detection/Correction and Local Minima/Maxima Techniques for Extracting a New Arabic Benchmark Database
We propose a set of techniques for extracting a new standard benchmark database for Arabic handwritten scripts. Thresholding, filtering, and skew detection/correction techniques are developed as a pre-processing step of the database. Local minima and maxima using horizontal and vertical histogram are implemented for extracting the script elements of the database. Elements of the database contai...
متن کاملAngular Skew Correction Algorithm for Handwritten Hindi Text
In large scale document digitalization of hand written and printed documents, they are scanned and stored in digital form. Since many of these documents are hand written they have errors like angular skewness of words. Skew detection and removal is a part of pre-processing before using OCR software to digitalize the documents. One type of skewness that is most difficult to detect and correct is...
متن کاملA new algorithm for skew detection and correction
Here we have proposed two algorithms. The first one detects the skewing of words and the second corrects the skewing from handwritten words. Both algorithms make use of the Radon transform based projection profile technique. The method does not require pre-processing and it works equally good even with noise. The method is fast. The algorithms have been tested on words taken from more than 200 ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005